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(54) Title: VIRTUAL MULTICHANNEL SPEAKER SYSTEM 

(57) Abstract: A virtual multichannel sound system is presented to improve audio reproduction by siadcaUy or dynamically con- 
forming signal processing to specific speaker characteristics and/or arrangements. According to one such aspect, one or more dy- 
namic signal processing algorithms driving two or more speakers are altered in response to the relative physical characteristics or 
aitangements of these speakers, where parameter information for these algorithms is either factory set. user input, or automaticaUy 
suppbed to the processor. Examples of such relative speaker differences include speaker spacing or alignment, speaker or enclosure 
con^)liance, and enclosure configuration. Another aspect is to alter the processing algoritfmis in response to conunon speaker char- 
actCTistics for certain conditions of input signals. An example of diis aspect is to alter the signal processing to improve bass response 
as a function of bass content in die signals being presented to the speakers and speaker size as well as relative speaker position. 
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VIRTUAL MULTICHANNEL SPEAKER SYSTEM 



Background of the Invention 

This invention relates generally to sound reproduction systems and, 
more specifically, to the enhancement of multichannel sound reproduction through 
5 improved speaker arrangement and the relation of this arrangement to audio signal 
processors and their algorithms. 

A number of systems have been proposed for expanding the stereo 
image present in stereo source material. These systems employ a number of 
techniques and algorithms to expand the stereo image beyond the confines of the 

10 left and right speakers. Such systems have also been adapted to source material 
with more than two independent input channels, and for use with more than two 
speakers. These find application in computer sound playback, home and car audio 
systems, and many other applications based on material from any of the many 
computer storage systems, video and audio cassettes, compact discs, FM broadcasts, 

1 5 and all other available stereo and multichaimel media. 

The generic stereo or two output channel arrangement of the prior 
art is shown in Figure 1 . A listener 10 is positioned some distance D away from the 
midpoint between a pair of speakers 1 3 and 1 4. This midpoint is taken as the origin 
of the reference coordinates (x,y), with the X-axis extending as shown toward the 

20 primary listening area. In a general placement, each of the speakers, 1 3 and 1 4, will 
be different distance from the listener 10 and, in particular, a different distance 
from each of the listener's ears 1 1 and 12. The signals to the right speaker 14 and 
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the left speaker 13 are supplied from an audio signal processor 17 along lines 16 
and 1 5, respectively. The signal processor produces the output signals along 1 5 and 
16 based upon the audio signals input from lines 18. In the case of a 2 input, 2 
output, or 2-2, signal processor, there are only two input lines 18. 
5 In the simplest case, the signal processor is absent and a pair of input 

lines 1 8 from a stereo audio source are then the same as lines 1 5 and 1 6 and there 
is no enhancement of the stereo signals. When a signal is transmitted from a single 
speaker, say the right speaker 14, the listener identifies the location of the speaker 
as (x^yr) based on the difference between what is perceived at the right ear 12 and 

1 0 what is perceived at the left ear 1 1 . This difference in perception is due, firstly, to 
the difference in path lengths between the right speaker and the right ear, d^ and 
between the right speaker and the left ear, d^„ and to a difference in audio level. 
This difference produces a corresponding delay in the signal at the left ear as it must 
propagate the additional distance Adf= d^, - d„. But there are also additional effects: 

1 5 These arise as the head of the listener 1 0 is not acoustically transparent to the soiind 
waves and will alter them as they propagate aroimd the head to the left ear 1 1 . This 
filtering effect is described in terms of Head Related Transfer Functions (HRTFs). 
This combination of signal delay and alteration as perceived by the listener 
contribute to how the source of the sound is identified as being at the point (x^,yr). 

20 To produce a soimd that the listener will perceive as being located 

at an arbitrary point (x,y), a speaker 19 would ideally, but impractically, be placed 
at each such position (x,y). To produce the sounds across the entire front field of 
the listener, such as is desired for home theater, computer games, or many other 
uses, would therefore require a vast number of speakers and a corresponding 

25 number of independent signals for this surround sound or multichannel effect. To 
mimic this effect, the psycho-acoustical mechanisms that allow the listener to fix 
the location of a sound source can be exploited through delay and HRTFs. 

A number of different algorithms exist for this purpose and are 
widely know in the art. Examples and sources include Dolby Laboratories, Q- 

30 Sound Corporation, Spatializer Corporation, Aureal Semiconductor, Harman 
International, and SRS True Surround. These would then be employed inside the 
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signal processor 17 to produce output signals on lines 13 and 14. There may be 
more than two inputs signals, for instance in the case of 5.1 home theater system 
which employ left, right, and center forward channels as well as left and right 
surround channels. These algorithms rely upon encoding/decoding schemes to 
5 create a spatial representation of recorded materials, allowing them to place the 
sound at the perceived location (x,y) of a virtual speaker 19 without requiring a 
physical speaker at this location. 

These signal processing algorithms employ delay, HRTFs, inter- 
aiiral crosstalk cancellation, and other methods known in the field of binaural 

1 0 hearing using two speakers. A generic example of such a prior art signal processor 
is shown in Figure 2 as a block diagram for the case of two input signals 1 8. For 
a signal L entering the left input channel of 1 7, this signal is also supplied to the 
right output channel at the adder 28 after going through the inverter 22 and having 
its amplitude diminished and delayed by block 25. By including this out of phase, 

15 delayed, and diminished version of the signal L in the right output signal R' and 
transmitting it to the right speaker in addition to supplying the signal L to the left 
speaker, the perceived soiurce of the soimd is de-localized from the left speaker. A 
similar process, based on inverter 2 1 and block 24, produces a signal from the right 
input R that adder 27 combines to L to form output signal L' that de-localizes 

20 signals from the right channel. By further incorporating HRTFs into blocks 24 and 
25, along with similar processing in the blocks 23 and 26, it possible to simulate the 
psycho-acoustic stimuli of multichannel or surround stereo with only a pair of 
speakers. Additionally, by a proper constmction of HRTFs, variations in the 
vertical position, a suppressed z direction in Figure 1, may also be mimicked. 

25 Although these algorithms as embodied in a signal processing circuit 

can be effective in enhancing stereo reproduction to produce virtual multichannel 
or surround sound, there are a number of shortcomings. A primary one of these is 
inherent in the algorithms themselves: To produce the output signals L', R' from 
the input signals L, R requires a number of assumptions to be made about both the 

30 location of the speakers 13 and 14 as well as the actual speakers themselves. For 
the various processing blocks 23, 24, 25, and 26 to provide the correct delays. 
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HRTFs, and so on requires the algorithm to assume a particular speaker separation 
and alignment modeled on point-like speakers. It must also make a series of 
assumptions about r 'leaker response, particularly about the differential response of 
one speaker relative ^o the other. 
5 As these assumptions are built into the signal processor, it is 

important that the speakers are spaced correctly and, preferable, slightly above the 
Ustener: For the proper psycho-acoustical response, the physical speaker separation 
is more important than the Y location of the listener, with the listener's X position 
even less critical. Users frequently place speakers in an arbitrary maimer for any 

1 0 number of practical or aesthetic reasons, because the size or purpose of the correct 
physical separation is not known, or based on the incorrect assumption that a wider 
physical separation produces a better result. Additionally, for some computer 
monitors and other uses, the speakers are often fixed, but in a position that may be 
incorrect as the algorithm used may have been based on the speaker position of, say, 

1 5 a car. These defects undermine the algorithm at the core of the signal processor and 
are a serious limitation in the prior art. 

The alignment, or azimuthal angle, or the speaker axis also affects 
the sound received by the listener. The above example of speaker placement in a 
car compared to that in a home computer system is also illustrative of this problem: 

20 Car speakers are often placed in the doors of the automobile where the sound will 
come from the listener's sides, while personal computer applications usually place 
the speaker to the front of the listener. Aside from any change in relative delay of 
amplitude this may cause, these two placements will require different HRTFs as the 
sound will propagate around the listener on a different path. Even with the 

25 alignment of the application for which the algorithm was designed, aligning one 
speaker askew to the other speaker will create another differential response that will 
undemiine the algorithm. 

The assumptions about the speakers themselves include idealizing 
them as having the same response to a given input signal. Whether through using 

30 improperly matched speakers, differences in how they are connected, or even 
manufacturing variations, actual speaker pairs will, to degree or another, have 
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relative variations. Such variations will not only degrade the enhanced stereo 
algorithms described above, but also more "traditional" or non-enhanced stereo 
reproduction- Some of the more basic differences resulting from diCTerences in 
things such as speaker or enclosure compliance can be addressed by balance 
5 controls or graphic equalizers, but these are not concerned with the sort of dynamic 
signal processing, related to phase or other such parameters, such as is used for 
virtual speaker placement. 

One method known in the art for improving such enhanced stereo 
schemes is to employ one of the matrix encoding-decoding processes known in the 

1 0 literature for creating a spatial representation of recorded material, examples 
including ProLogic, Circle Surround, and Logic 7. Such schemes are dependent on 
special source material encoding. Generically, these processes start with n distinct 
sound channels that are matrix encoded into / channels for an n:l encoding. At the 
reproduction stage, these / channels are then subjected to hm matrix decoding to 

15 produce m output signals. Aside from other shortcoming, these algorithms still 
suffer from the need for proper speaker placement, but now have the additional 
complication that the signal processor must be able to handle the proper decoding 
scheme, which may or may not be compatible with other input material for the 
processor. 

20 One way to overcome some of these limitations is, of course, to 

introduce more independent sound channels and the corresponding speakers, as is 
done for instance in the Dolby Digital, Sony SDS, or DTS 5.1 channel cinema 
sound recording or Direct X computer game soxmd. All of these examples employ 
a pair of rear channels to provide stereo sound from the back. Although this may 

25 improve sound from the rear to produce a more realistic representation, it still 
leaves the previous limitations for the more important front sound channels. 
Additionally, although the psycho-acoustic localization of sound from the rear is 
less acute than from the front, the inclusion of rear speakers now introduces all of 
the speaker placement problems inherent in enhanced stereo algorithms to rear 

30 speakers as well as the front, though less critically so. 
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Similarly, such multichannel or matrix sound system would benefit 
fi-om an increase in the niunber of actual speakers, although a method would be 
needed to produce the signals suitable for these extra speakers. Once again, proper 
placement of these speakers is needed for the best results. 
5 Therefore, one objective of the present invention is to reduce these 

limitations by presenting an audio signal processor responsive to information on 
speaker placement and response. A second objective of the present invention is to 
reduce these limitations in such a manner as to not require intentional pre-encoding 
of the source material and is, therefore, of immediate use and applicability to 

10 current stereo recordings. Such improvements would also have applicability for 
producing virtual multichannel enhanced stereo as well as for non-enhanced, 
conventional multichannel sound. 

Other objectives are to present a speaker mechanism that holds the 
speakers in a set spatial relationship, either fixed or adjustable to each other and 

15 including a sensor mechanism to provide data about this relationship and other 
relative speaker infomiation. A fiirther objective is to use this information to effect 
variation in the algorithm employed by the audio signal processor. 

An additional objective of the present invention is to extend these 
other objectives beyond two channel stereo to matrix or multichannel audio systems 

20 by extending the same techniques to rear sound channels, and, fiirtheraiore, by such 
an application to produce a virtual rear center channel when only a left and right 
rear channel signal are provided. 

A fiirther object is to use such algorithms to provide audio signals 
to an even greater number of speaker pairs to flood an enclosed listening space with 

25 sounds fix)m a greater number of directions. 
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Summary of the Present Invention 

These and additional objects are accomplished by the various aspects 
of the present invention, wherein, briefly and generally, audio reproduction is 
improved by staticaliv or dynamically conforming the signal processing to specific 
5 speaker characteristics and/or arrangements. According to one such aspect, one or 
more dynamic signal processing algorithms driving two or more speakers are 
altered in response to the relative physical characteristics or arrangements of these 
speakers, where parameter information for these algorithms is either factory set, 
user input, or automatically supplied to the processor. Examples of such relative 

10 speaker differences include speaker spacing or alignment, speaker or enclosure 
compliance, and enclosure configuration. Another aspect is to alter the processing 
algorithms in response to common speaker characteristics for certain conditions of 
input signals. An example of this aspect is to alter the signal processing to improve 
bass response as a function of bass content in the signals being presented to the 

15 speakers and speaker size as well as relative speaker position. 

Additional objects, advantages, and features of the present invention 
will become apparent form the following description of its preferred embodiments, 
which description should be taken in conjunction with the accompanying drawings. 

Brief Description of the Drawings 

20 Figure 1 shows a prior art stereo arrangement. 

Figure 2 is a block diagram for an example of a prior art signal 

processor. 

Figure 3 shows a preferred embodiment of some aspects of the 
present invention. 

25 Figure 4 is a block diagram for a signal processor in Figure 3. 

Figure 5 is a block diagram of these aspects applied to a personal 

computer. 

Figure 6 shows the relation of a speaker enclosure described in the 
text and its relation to a video monitor. 
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Figure 7 is a flow chart for determining the correct choice of 
algorithm in a discrete embodiment of the present invention. 

Figure 8 shows two embodiments of the invention for a audio source 
* with rear sound channels. 
5 Figure 9a shows a 5.1 channel home sound system as commonly 

arranged in the prior art. 

Figure 9b shows a 5.1 channel home sound system employing one 
aspect of the present invention. 

Figure 10 shows another embodiment with four signal processors 
10 and four sets of speakers. 

Figure 11 shows an additional embodiment with four signal 
processors and two sets of speakers. 

Description of the Preferred Embodiments 

An embodiment of the present invention uses single driver speakers 

15 to improve spatial imaging by eliminating crossover network manufacturing 
variations in an arrangement of the speaker spacing with automatic adjustment of 
the digital signal processing algorithm based on the speaker spacing as sensed by 
the special speaker housings and connecting sleeve. Another aspect allows 
information on speaker spacing to be factory set or input by the user so that the 

20 signal processor may still be used with a pair of speakers not connected in a way 
that automatically provides this information. Conversely, a further aspect is a 
speaker enclosure that uses two single driver speakers in identical housings Joined 
by a mechanism that enables the spacing between the speakers to be set to match 
the width of the underlying supporting surface, such as a TV or computer monitor, 

25 by using a joining mechanism that allows the spacing to be optimized. 

Figure 3 shows several aspects of the present invention in this 
embodiment. As in Figure 1, a listener 10 is located in front of a pair of speakers 
13 and 14. The speakers are separated by a distance s from each other with their 
midpoint a distance D from the listener. This midpoint is taken as the origin of the 

30 reference coordinates (x,y), with the X-axis extending as shown toward the primary 
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listening area. The speakers 13 and 14 again receive the respective input from lines 
15 and 16 and the initial audio information comes in on a number of lines 18. 
Unlike the prior art, the speakers are nov/ in an enclosure 30 holding the matched 
speakers 13 and 14 in special housings with a joining mechanism that allows 
5 adjustment of the speaker spacing. This joining mechanism contains sensors to 
determine this physical separation s of the speakers and supply this information on 
output line 31, The Digital Signal Processor (DSP) 37 can now adjust its 
processing algorithms in response to this input 3 1 . Provision for the algorithms to 
be adjusted according to other automatic or manual inputs 32 is also included. 

1 0 Figure 4 corresponds to Figure 2, but with these parameter inputs 3 1 and 32 shown 
attached to processing blocks 23-26. 

This embodiment overcomes many of the limitations found in the 
prior art. Using matched speakers reduces relative variations in speaker and 
enclosure response as these are now identical within manufacturing tolerances. By 

1 5 placing the speakers in a special housings 30 with a connecting sleeve, they are held 
at in the proper spacing and azimuthal alignment for the algorithms used in the DSP 
37. That this is, in fact, the proper spacing is ensured by the speaker enclosure 30 
supplying, along output 31, information on this spacing, to which the DSP 37 will 
automatically adjust its algorithms. As DSP 37 will now automatically adjust its 

20 algorithms to the spacing of the speakers, the enclosure allows the separation to be 
adjusted to user preferences and not permanently fixed. Other embodiments could 
measure relative speaker distance by other methods. Individual speakers with 
optical or sonar ranging can be employed to measure and supply the speaker's 
distance to the DSP 37. 

25 The embodiment of Figure 3 removes or minimizes many of the 

relative variations that undermine the effectiveness of multichaimel sound 
reproduction as described in the background section. The inputs 3 1 and 32 allow 
for adjustments, either automatic or manual, to modify the signal processor 
algorithms to compensate for others. In the embodiment of Figure 3 and other 

30 embodiments below, only the speaker spacing is given as an explicit input 
parameter as this is both an important example and is easily discussed and shown 
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in the figures. More general embodiments may employ a higher dimensional space 
of input parameters. For example, the signal processor described above may be 
employed with a pair of speaker not in the described enclosure. In this case, 
variations in speaker and enclosure compliance, differences in enclosure 
5 configuration, and azimuthal alignment of speaker axes could also be entered into 
the algorithms in addition to inter-speaker separation. Preferable these and other 
parameters used for dynamic processing adjustments are made automatically 
through input 31, although manual input 32 allows them to be entered along with 
other information such as choice of matrix decoding scheme. The option of manual 

1 0 input allows the signal processor to be used with prior art speakers. 

By using the automatic supply of parameters, such as inter-speaker 
separation s in the embodiment of Figure 3, this aspect of the present invention 
allows for the automatic dynamic processing of input signals to drive the speakers 
based on parameters determined by the relative characteristics of the speakers. The 

1 5 actual parameters may be either static, such as speaker spacing, or dynamic, such 
as speaker compliance. A familiar prior art example of parameters that may be 
altered is the combination of volume and balance controls: The volume control is 
an input common to both channel which sets the overall loudness, while the balance 
control determines the relative loudness of the two channels. The balance is an 

20 example of a parameter based on relative characteristics. The sort of processing 
variations under consideration here are dynamic alterations in the processing 
algorithms affecting properties such as the phase of the signals within the processor. 
Aside fi-om applications for enhanced stereo employing HRTFs and other 
enhancement methods, standard multichannel soimd reproduction could also benefit 

25 fi-om these techniques to offset problems due to those relative speaker differences 
and placement problems. 

As discussed above in the Background, it is this proper physical 
speaker separation for a processor's algorithm that largely determines the 
effectiveness of that algorithm: It is more important than the listener's Y position 

30 or the even less critical X position. To exactly position the location of speakers 1 3 
and 14, they would, as an idealization, be point sources. For this reason, one 
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preferred embodiment employs a single driver speaker for each of 1 3 and 1 4. Since 
it is physically impossible to move the amount of air needed for low frequencies 
with small drivers, this results in a trade off between maximizing the effectiveness 
of the stereo enhancement of the DSP 37 and the frequency response of larger 
5 and/or multiple speakers. Another standard solution to this problem is to employ 
a separate subwoofer for low frequencies to exploit the psycho-acoustical effect that 
these low frequencies can not be localized as well as higher frequencies. This may 
be realized with a ported enclosure for bass. 

Another solution to the lack of bass response for smaller speakers is 

1 0 an aspect of the present invention that can be incorporated within the embodiment 
of Figure 3 or other embodiments. This would also involve automatic dynamic 
processing of the input signals within the signal processor, but now to improve bass 
response based upon speaker size as well as relative speaker position. By driving 
the speakers in unison, the effective bass response is improved since, functioning 

15 together, they can move a larger quantity of air. Above a chosen frequency, ttie 
individual signals would maintain the values they would have without the 
incorporation of this aspect. Below a second lower frequency, say lOOHz, both 
chaimels would be provided the same output signals with the same phase. In 
between these two frequencies, the individual signals would transition between 

20 these two states in a smooth manner, so that there would be no abrapt change at the 
transition frequencies. The choice of transition frequencies and characteristics 
could be chosen based on speaker characteristics combined with the de-localization 
effect of lower frequencies. In this way, a digital signal processor may be used as 
a crossover network with phase adjustment to enable using single or multi-driver 

25 speakers more effectively for virtual 3D and other sound applications. 

The described invention can be used to advantage in any of the 
applications for enhanced stereo. These include the home audio uses of rendering 
surroimd sound from stereo and matrix stereo sources, such as records, reel-to-reel 
and cassette tapes, VHS video cassettes, compact discs (CDs), Laserdiscs, or DVDs, 

30 and car and RV audio rendering from stereo media such as tape, radio broadcasts, 
CDs, or VHS video cassettes. For illustrative purposes, the next part of the 
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discussion will, however, largely focus on computer sound playback from any of 
the standard sources. To simplify the figures and discussion, these again mainly use 
speaker separation as the single input parameter, although the other parameters 
described above and in the following may be included in other embodiments. 
5 Additionally, although the signal processor DSP 37 is a digital device, analog 
techniques could also be utilized in other embodiments. 

In this context of a PC, Figure 5 shows a block diagram of a 
preferred embodiment. The audio source 40, such as a PC sound card, supplies a 
left and right signal on lines 18 to the DSP 37. As these may be encoded by any 

10 number of the standard schemes available, the DSP 37 will also include the 
corresponding decoding process in connection with its virtual multichaimel 
algorithms. To allow, as a sub-aspect of die present invention, the use of DSP 37 
with a standard pair of powered speakers, input 32 allows for the physical speaker 
separation to be input manually. In a more a general embodiment, other 

1 5 information, say, related to room acoustics, such as distance to rear front walls, 
reverb, speaker response, variations in HRTFs, or choice of decoding algorithm, 
could also be supplied at input 32. As shown, however, the preferred embodiment 
does supply the modified left and right signals L' 1 5 and R' 16 to their respective 
speakers 13 and 14. The data on the separation of the speakers is given to the DSP 

20 37 from the speaker enclosure along line 31. In response to this input, the 
processing algorithm is adjusted for the speaker separation 5, so that L' = L'(^) and 
R' = R'(5). 

Figure 6 shows another sub-aspect of the present invention in the 
preferred embodiment described above. The speaker enclosure is shown as 30, 30', 

25 and 30" adjusted to respective separations 5, s\ and 5". By having the two single 
drivers in matched housings, relative compliance and alignment variations are 
minimized. The enclosure joins them by a mechanism that enables the spacing 
between the speakers to be set to match the width of the underlying supporting 
surface, typically a TV or computer video monitor. The joining mechanism 

30 contains sensors to enable the DSP algorithm to be optimized for the specific 
spacing. It also serves several practical purposes: The first of these is that of 
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keeping the separation of the speakers within the optimal range for stereo 
enhancement algorithms, which is somewhat larger than the width of the listeners 
head. Another is that it will place the speakers in a better vertical alignment, 
namely, even with or slightly higher than the listener. Finally, it solves the problem 
5 of where to place the speakers, a practical difficulty that is often the cause of 
incorrect speaker placement, by transferring them from the desktop or other 
valuable area to a space normally not used. 

Although the discussion so far has implicitly assumed that the 
speaker geometry is continuously adjustable and that the algorithms would 
1 0 correspondingly be continuously variable in response, in the preferred embodiment 
this is not the case. To have the DSP algorithms continuously adjustable would 
require a more complicated and, consequentially, more expensive implementation. 
Instead, the preferred embodiment has the algorithm set for a number of discrete 
values for speaker spacing. By including enough different values, this serves as a 

1 5 practical compromise between cost and complexity. These preset values can be set 
for a number of standard speaker spacings, say 14 inches, 17 inches, and so on, 
corresponding to popular monitor sizes on top of which the enclosure would be 
placed. The DSP could then determine by a look up table, a predetermined table of 
constants, and/or other processing variables which of the discrete algorithms is 

20 appropriate for the spacing range into which the speakers fall. 

Figure 7 shows a flow chart for a simpUfied example of the process. 
At step 100, the value of s is provided. This can be provided automatically, as in 
the preferred embodiments described, or entered manually by the user. For the 
cases described below with more than one pair of speakers, s would be a vector 

25 containing the various relative separations of the speakers. At step 1 10, the value 
range into which s fits is determined. This is chosen to be one of a set of ranges 
corresponding to spacing values appropriate to the application. In this example, 
three ranges corresponding 1 4, 1 7, and 2 1 inches are used: For s<l5'\ an algorithm 
based on 14" is used in step 1 14; if 15"^5<19", an algorithm instead based on 17" 

30 is used in step 1 17; and when 19" ^5, step 121 uses an algorithm based on a 21" 



0076266A2_L> 



wo 00/76266 



PCT/USOO/13415 



14 

sq^aration. Any of the standard enhanced stereo algorithms appropriate to these 
values could then be employed. 

A variation on the above embodiments is the case of the speakers in 
a constant relationship to each other. The virtual multichannel algorithm can then 
5 be conformed to this fixed difference. In this way, an algorithm with parameters 
for this specific configuration may be incorporated into a circuit for use with a 
specified speaker configuration, thereby allowing these enhancement parameters to 
be factory set. 

Other aspects of the present invention incorporate such algorithms 

10 in the production of signals for rear speakers, which, in one embodiment, also use 
a speaker enclosure to provide for automatic adjustment of a digital signal 
processing algorithm. These aspects can be used with sources which provide rear 
audio signals and also to provide a virtual rear center channel for 5. 1 channel home 
cinema and other applications. A further extension are aspects that apply these 

1 5 signal processors and speaker enclosures to produce audio signals for side speakers 
to increase sound immersion. The inclusion of side speakers allows for a smoother 
transition between fix>nt sourced sounds and rear sourced sounds in addition to the 
more accurate placement of sound to the sides. 

A number of personal computer audio sources have a provision for 

20 rear sound channels. Figure 8a shows such a situation where the audio source 40 
now has left and right rear signals on lines 65 and 66 to respective speakers 63 and 
64. The fi-ont audio charmels are as before in Figiu-e 5. This allows the use of DSP 
37 and speaker enclosure 30 for the fi-ont channels, where the listeners ability to 
localizes a sound is more acute, while taking advantage of provided rear channels 

25 signals. It should be noted that although the figiires refer to powered speakers, 
since these are common in the personal computer examples being used, other 
embodiments need not use these and could employ other means for amphfication. 

Figure 8b is a preferred variation of the arrangement of Figure 8a. 
Even though hearing from the rear is less highly localized by the listener, including 

30 a second DSP for the rear, DSPg 67, will produce a virtual multichannel surround 
sound environment from that direction. This embodiment will employ a speaker 
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enclosure 60 with input 61 back to DSPg 67 for the rear for automatic adjustment 
of DSPs's algorithm, just as the front speaker enclosure 30 does for the front 
channel processor, now labeled DSPjj 37. To further improve the sound 
environment, as the soimd waves will propagate around the listener differently from 
5 the rear than from the front, the preferred embodiment will employ HRTFs 
appropriate to a rear speaker position in DSPg 67. Although Figure 8b shows the 
front enclosure 30 and rear enclosure 60 with the same spacing, this is just for 
illustrative purposes as these spacing are independent and need not be the same. A 
unified embodiment could combine DSPg 67 and DSP^ 37 into a single unit taking 

1 0 both inputs 1 8 and inputs 68 from audio soiu-ce 40 as well as the inputs 3 1 and 61 
from respective enclosures 30 and 60. 

An embodiment intermediate between Figures 8a and 8b is also 
possible, where DSPg 67 is employed, but with speakers 63 and 64 not contained 
in an enclosiire 60 and information on rear speaker geometry now from input 62. 

15 This could be due to practicalities of speaker placement or to save on equipment 
costs. Additionally, any of these variations on Figure 8b could additionally use the 
separation between the front and the back speaker pairs to modify the algorithms 
in DSPs 67 and DSP^ 37 to optimized the sound environment based on this 
additional input. 

20 Moving away from the generic example discussed in terms of a PC 

embodiment, the use of an arrangement enabling adjustment of the speaker spacing 
with automatic adjustment of the DSP algorithm can be apphed to the more specific 
example of home theater sound systems. Figure 9a shows a prior art arrangement 
for a 5.1 channel system. This provides for 5 channels of audio sound, with the 1 

25 referring to a non-directional low frequency channel. These five channels are 
distributed among left, center, and right front channels with respective speakers 71 , 
72, and 73, and left and right rear, or surround, channels with respective speakers 
74 and 75. One aspect of the current invention is employed in a preferred 
embodiment shown in Figure 9b. Speakers Lg 74 and Rg 75 are now in enclosure 

30 76 coimected to DSP 77 in the manner described above with respect to Figures 5 
and 8b. This will now produce a virtual multichannel sound environment for the 
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rear or surround channels, and can produce a virtual center rear channel to 
correspond to or complement the actual front center chaimel. An embodiment 
intermediate betwe n Figures 9a and 9b is again possible, using DSP 77 but with 
separate speakers 74 and Rs 75 not in a single enclosure 76, information on the 

5 geometry of these speakers input at 78. 

Returning to the PC example of an audio source with two front and 
two rear output signals. Figures 10 and 11 present embodiments of two further 
aspects of the present invention which employ four DSPs. Even with the virtual 
multichannel enhancement of the present invention applied to both front and rear 

10 channels as in Figure 9b, there may still be a large physical gap between the front 
speaker enclosure 30 and the rear enclosure 60. Representation ofsound from the 
listener's sides will not be as realistic as from placement of actual speakers to the 
listener's left and right. A preferred embodiment for such an arrangement is shown 
in Figure 10. 

1 5 Figure 1 0 starts from the arrangement of Figure 8b, but then adds on 

two additional speaker enclosure/DSP pairs: DSPe 82 and enclosure 84 to the right, 
or east, to produce sound from speakers 86 and 88, and DSPw 81 and enclosure 83 
to the left, or west, to produce sound from speakers 85 and 87. DSPg 82 and DSP^ 
8 1 receive their input from both front and rear channels. This use of multiple two 

20 speaker enclosures will flood the enclosed listening space and produce a smoother 
transition between front and rear sound location as well as better definition of side 
source sounds. As with the front and rear signal processors, DSPg 82 and DSPw 
8 1 will preferably employ HRTFs appropriate for their relation to the listening area. 
Although the four pairs of speakers are shown in enclosures 30, 60, 83, and 84, 

25 other embodiments could replace any or all of these with just a generic pair of 
speakers such that any two adjacent speakers in a configuration constitute a two 
speaker pair. 

Figure 1 0 shows one preferred embodiment among many variations. 
As with Figure 8b, one variation could then combine DSPg 67 and DSP^ 37 into a 
30 single front/back unit, with DSPg 82 and DSPw 81 into a second left/right unit. 
Another is to combine the four DSPs 37, 67, 81, and 82 into a single device with 
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four audio inputs for receiving audio data from a 4-channel audio source 40, four 
pair of speaker outputs, and an input from each of the four speaker enclosures in 
addition to any manual inputs. Other variations would involve replacing some or 
all of the speaker enclosures or DSPs with prior art versions in the ways described 
5 above for rear surround speakers. Although this deprives the invention of many of 
its advantages, the inclusion of additional side speakers with a prior art DSP would 
still give the possibility to improve front-rear transitions and side sourced sounds 
better that an arrangement which lacked these speakers. For any of these variations, 
a variation would also include additional provisions for the relative position of 
10 speaker pairs in addition to the relative position of individual speakers within a 
given pair. 

One particular enviroiunent where the use of side speakers is 
common, and which would benefit from the DSPs of the invention allowing the 
physical speaker separation to be input to optimize their algorithms, is in 
15 automobiles. The appropriate adaptation of an arrangement such as Figure 10 to 
automotive sound systems could greatly improve their perceived sound 
reproduction, where choice of the appropriate input can be made automatic* by 
coding the wiring harness of different models or through other mechanisms. As 
with signals from the rear, these side signals would also have HRTFs appropriate 
20 to their relation to the listener. 

An embodiment of an aspect of the current invention again 
employing four DSPs 37, 67, 81, and 82, but only two speaker enclosures 30 and 
60, is shown in Figure 1 1 . Again, this should be compared to Figure 8b, of which 
it is an extension. The DSPs receive their inputs the same as in Figure 10, but now 
these signals are summed and returned to only the front pair of speakers 13 and 14 
and the rear pair of speakers 63 and 64. The inputs from enclosures 30 and 60 to 
the DSPs 37, 67, 81, and 82 are suppressed to simplify the drawing. 

Adders 9 1 -94 combine signals from the side DSPs with the front and 
rear DSPs. For example, the left front signal on 1 5 is now the sum of the left signal 
from the front DSP 37 and the right signal of the right DSP 81 . The result is more 
wrap around to the sides. The resultant signals are given by: 
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L = k,,LN + k,bRW 
R = k23RN + k2bLE 
Ls=k33LS + k3,LW 
Rs=k43RE + k4bRS. 



5 The ks are constants introduced to allow the relative amplitudes to be varied 
according to the acoustic environment or other needs. For example, in the 
symmetric situation shown in Figure 1 1 placed in a symmetric environment, the 
choice k = l/>/2 for all of the A^s gives a symmetric output for symmetric adder 
inputs and results in unit output amplitude for unit adder input amplitudes. This 
10 will have much the same advantage as the arrangements discussed with respect to 
Figure 10, but in situations where the additional speakers are not desirable or 
practical. 

Various details of the implementation and method are merely 
illustrative of the invCTition. It will be understood that various changes in such 
1 5 details may be within the scope of the invention, which is to be limited only by the 
appended claims. 



BNSDOCID: <WO 007e266A2J.> 



AA/-!^^.:^ ' PCT/USOO/13415 

WO 00/76266 



19 

WHAT IS CLAIMED IS: 

1 . Amethod for modifying the acoustic effect of an array of two 
or more speakers resj^onsive to a plurality of audio input signals from one or more 
signal processors, wherein each of said speakers is comprised of one or more 
acoustic transducers, comprising: 

5 providing one or more parameters derived from the physical 

relational characteristics of said speakers; and 

using at least one of said parameters to modify said audio input 

signals. 

2. The method of claim 1, wherein said physical relational 
characteristics include the distances between said two or more speakers. 

3. The method of claim 1, wherein said two or more speakers 
are in the same enclosure. 

4. An apparatus comprising: 

two or more speakers, each of said speakers comprised of one or 
more acoustic transducers; 

a measuring device to determine one or more physical relational 
5 characteristic of said speakers; 

an input circuit that provides one or more parameters derived from 
said physical relational characteristics of said speakers to said signal processors; and 
at least one signal processor to apply acoustic processing to signals 
driving said plurality of acoustic transducers, wherein said acoustic processing is 
10 responsive to at least one of said parameters. 

5. The apparatus of claim 4, wherein said speaker array 
comprises two speakers arranged in an enclosure which holds them a specified 
separation distance apart, and wherein said specified separation distance is fixed. 
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6. The apparatus of claim 4, wherein said speaker array 
comprises two speakers arranged in an enclosure which holds them a specified 
separation distance apart, and wherein said specified separation distance is 
adjustable. 

7. The apparatus of claim 4, wherein said speaker array 

comprises: 

a first speaker assembly placed in firont of a listening area; and 
a second speaker assembly placed to the rear of said listening area. 

8. The apparatus of claim 7, wherein said speaker array further 

comprises: 

a third speaker assembly placed to the left of said listening area; and 
a fourth speaker assembly placed to the right of said listening area. 

9. The apparatus of either of claims 7 or 8, wherein said 
physical relational characteristics include the relative locations of each of the two 
speaker assemblies. 

10. A speaker assembly, comprising two fixed speakers mounted 
in a predetermined position with respect to each other, wherein each of said 
speakers includes one or more acoustic transducers, said speakers being responsive 
to a plurality of audio input signals fi-om one or more signal processors, wherein 

5 said audio input signals are derived based on fixed input parameters determined by 
said predetermined position. 

11. A speaker system comprising two speaker assemblies, a first 
one of said speaker assemblies moimted in fi-ont of a listening area and a second one 
of said speaker assemblies mounted behind said listening area, wherein each of said 
assemblies comprise two fixed speakers moimted in a predetermined position with 
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5 respect to each other, wherein each of said speakers includes one or more acoustic 
transducers, said speakers being responsive to a plurality of audio input signals 
from one or more signal processors, wherein said audio input signals are derived 
based on fixed input parameters determined by said predetermined positions. 

12. A speaker system comprising four speaker assemblies, a first 
one of said speaker assemblies mounted in front of a listening area, a second one 
of said speaker assemblies mounted to the rear of said listening area, a third one of 
said speaker assemblies mounted to the to the right of said listening area and a 

5 fourth one of said speaker assemblies moimted to the to the left of said listening 
area, wherein each of said assemblies comprise two fixed speakers mounted in a 
predetermined position with respect to each other, wherein each of said speakers 
includes one or more acoustic transducers, said speakers being responsive to a 
plurality of audio input signals from one or more signal processors, wherein said 
1 0 audio input signals are derived based on fixed input parameters determined by said 
predetermined positions. 

13. A speaker enclosure for providing enhanced two channel 
sound in response to a first and a second input signal, said enclosure comprising: 

a pair of speakers, wherein a first of said speakers is responsive to 
said first input signal and a second of said speakers is responsive to said second 
5 input signal; and 

an enclosure for said pair of speakers to hold them in a specified 
physical relation, wherein said enclosure provides data based on said specified 
physical relation as an output signal. 

14. The speaker system of claim 13, wherein said specified 
physical relation is adjustable. 

15. The speaker system of claim 14, wherein said speakers are 
essentially identical. 
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16. The speaker system of claim 15, wherein each of said 
speakers comprise a single acoustic transducer. 

17. The speaker system of claim 15, wherein said specified 
physical relation comprises the distance between said speakers. 

18. A method of processing audio signals for use in a plurality 
of speakers for the acoustic display of aural information, wherein some of the 
reproduced aural information appears to a listener to emanate from a virtual source 
which is spaced fix)m the speakers, comprising: 

5 receiving a plurality of audio signals; 

providing one or more input parameters, wherein at least one of said 
input parameters is derived fijom the relative physical characteristics of said 
plurality of speakers; 

producing a plurality of enhanced output signals for use in said 
10 speakers, wherein said output signals are derived from said plurality of audio 
signals in response to said one or more input parameters. 

19. The method of claim 18, wherein said relative physical 
characteristics include the relative position of said plurality speakers. 

20. The method of claim 18, wherein said relative physical 
characteristics include the relative alignment of said plurality speakers. 

21. The method of claim 18, wherein said relative physical 
characteristics include the relative compliance of said pluraHty speakers. 

22. The method of claim 1 8, wherein said plurality of speakers 
are contained within enclosures, and wherein said relative physical characteristics 
include the relative compliance of said enclosures. 
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23. A sound reproduction system comprising: 
a first speaker array comprising: 

a pair of essentially identical speakers each comprising a single 
acoustic transducer, wherein a first of said speakers is responsive to a first input 
5 signal and a second of said speakers is responsive to a second input signal; and 

an enclosure for said pair of single driver speakers to hold them in 
a specified physical relation, wherein said enclosure provides data based on said 
specified physical relation as an output signal; and 

a signal processor for providing said first and second input signals 

10 comprising: 

an audio input circuit to receive a plurality of audio signals; 
a parameter circuit to receive one or more first input parameters, 
wherein at least one of said first input parameters is derived firom said output signal 
provided by said first speaker array; and 
15 an output circuit coupled to said parameter circuit to provide said 

input signals, wherein said input signals are derived by said output circuit from said 
plurality of audio signals in response to said one or more first input parameters. 

24. The sound reproduction system of claim 23, wherein said 
specified physical relation is adjustable. 

25. The sound reproduction system of claim 24, wherein said 
specified physical relation includes the distance between said pair of isolated 
speakers. 

26. A soimd reproduction system, wherein some of the 
reproduced sound appears to a listener to emanate fi*om a virtual source which is 
spaced fi-om the speakers, comprising: 

a first speaker array comprising: 
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5 a plurality of speakers, wherein said speakers are responsive to one 

or more of a plurality of input signals; and 

a signal processor for providing said plurality of input signals 

comprising: 

an audio input circuit to receive a plurality of audio signals; 
10 a parameter input circuit to receive one or more first input 

parameters, wherein at least one of said first input parameters is derived firom the 
relative physical characteristics of said first speaker array; and 

an output circuit coupled to said parameter input circuit to provide 
said plurality of input signals, wherein said input signals are enhanced signals 
1 5 derived by said output circuit fi-om said plurality of audio signals in response to said 
one or more first input parameters. 

27. The sound reproduction system of claim 26, wherein said 
first speaker array fiirther comprises an enclosure for holding said speakers in a 
specified physical relation. 

28. The sound reproduction system of claim 27, wherein said 
plurality of speakers is a pair of essentially identical speakers each comprising a 
single acoustic transducer. 

29. The sound reproduction system of claim 26, wherein said 
5 relative physical characteristics include the relative position of said plurality 

speakers. 

30. The soxmd reproduction system of claim 29, wherein said 
first speaker array fiirther comprises: 

a locator to provide data derived form the spatial relation of said first 
speaker array; and 

5 wherein said relative position of said plurality speakers is derived 

fix)m said data. 
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3 1 . The sound reproduction system of claim 26, wherein said 
relative physical characteristics include the relative alignment of said plurality 
speakers. 

32. The sound reproduction system of claim 26, wherein said 
relative physical characteristics include the relative compliance of said plurality 
speakers. 

33. The sound reproduction system of claim 26, wherein said 
plurality of speakers are contained within enclosures and wherein said relative 
physical characteristics include the relative compliance of said enclosures. 

34. The sound reproduction system of either of claims 25 or 29, 
wherein said first speaker aixay is located in front of said listener, further 
comprising: 

a plurality of rear speakers located to the rear of said listener, 
5 wherein said rear speakers are responsive to one or more of a plurality of rear input 
signals; 

a rear audio input circuit to receive said plurality of audio signals; 

and 

a rear output circuit to provide said plurality of rear input signals, 
10 wherein said rear input signals are derived by said rear output circuit from said 
plurality of audio signals. 

35. The sound reproduction system of either of claims 25 or 29, 
wherein said first speaker array is located in firont of said listener, further 
comprising: 

a rear speaker array comprising: 
5 a plurality of rear speakers located to the rear of said listener, 

wherein said rear speakers are responsive to one or more of a plurality of rear input 
signals; and 
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a rear signal processor for providing said plurality of rear input 
signals comprising: 

1 0 an rear audio input circuit to receive said plurality of audio signals; 

a rear parameter input circuit to receive one or more rear input 
parameters, wherein at least one of said rear input parameters is derived from the 
relative physical characteristics of said rear speaker array; and 

a rear output circuit coupled to said rear parameter input circuit to 
15 provide said plurality of rear input signals, wherein said rear input signals are 
derived by said rear output circuit from said plurahty of audio signals in response 
to said one or more rear input parameters. 

36. The soimd reproduction system of claim 3 5 , wherein at least 
one of said first input parameters and said rear input parameters are derived from 
the relative position said rear speaker array with respect to said first speaker airay. 

37. The sound reproduction system of claim 35, wherein said 
front and rear signal processors are combined in a single circuit. 

38. The sound reproduction system of claim 35, wherein said 
5 rear input signals are enhanced multichannel signals. 

39. A sound reproduction system for providing the acoustic 
display of aural information to a hstener, wherein some of the reproduced aural 
information appears to said listener to emanate from a virtual source which is 
spaced from the speakers, comprising: 

5 a pair of front speakers placed in front of said listener, wherein a first 

of said front speakers is responsive to a first front input signal and a second of said 
front speakers is responsive to a second front input signal; 

a pair of rear speakers placed to the rear of said listener, wherein a 
first of said rear speakers is responsive to a first rear input signal and a second of 
10 said rear speakers is responsive to a second rear input signal; and 
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at least one signal processor to supply said front and rear input 
signals, wherein said input signals are enhanced signals derived by said at least one 
signal processor from a plurality of audio signals in response to the relative physical 
characteristics of said speakers. 

40. The sound reproduction system of claim 39, further 

comprising: 

a pair of left speakers placed to the left of said hstener, wherein a 
first of said left speakers is responsive to a first left input signal and a second of said 
5 left speakers is responsive to a second left input signal; and 

a pair of right speakers placed to the right of said listener, wherein 
a first of said right speakers is responsive to a first right input signal and a second 
of said right speakers is responsive to a second right input signal; and 

wherein said at least one signal processor additionally supplies said 
1 0 left and right input signals. 

41 . The sound reproduction system of either of claims 39 or 40, 
wherein said relative physical characteristics comprise the relative positions of said 
speakers. 

42. A method of processing audio signals for providing first and 
second output signals for use in a pair of essentially identical speakers for the 
acoustic display of aural information, the first of said speakers responsive to said 
first output signal and the second of said speakers responsive to said second output 

5 signals, comprising: 

receiving a plurality of audio signals; 

providing one or more input parameters, wherein at least one of said 
input parameters is derived from the physical characteristics of said pair of 
speakers; and 

I Q producing said output signals, wherein above a first frequency, said 

first and second output signals are independent, and in a range of frequencies below 
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said first frequency, said first and second output signals are the same, and wherein 
said first frequency is derived by said output circuit from said plurality of audio 
signals in response to said one or more input parameters. 

43 . The method of claim 42, wherein said physical characteristics 
includes the size of said speakers. 

44. The method of claim 42, wherein said output signals are 
enhanced signals. 
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